Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning

نویسندگان

Sébastien Forestier

Yoan Mollard

Pierre-Yves Oudeyer

چکیده

Intrinsically motivated spontaneous exploration is a key enabler of autonomous lifelong learning in human children. It allows them to discover and acquire large repertoires of skills through self-generation, self-selection, self-ordering and self-experimentation of learning goals. We present the unsupervised multi-goal reinforcement learning formal framework as well as an algorithmic approach called intrinsically motivated goal exploration processes (IMGEP) to enable similar properties of autonomous learning in machines. The IMGEP algorithmic architecture relies on several principles: 1) self-generation of goals as parameterized reinforcement learning problems; 2) selection of goals based on intrinsic rewards; 3) exploration with parameterized time-bounded policies and fast incremental goal-parameterized policy search; 4) systematic reuse of information acquired when targeting a goal for improving other goals. We present a particularly efficient form of IMGEP that uses a modular representation of goal spaces as well as intrinsic rewards based on learning progress. We show how IMGEPs automatically generate a learning curriculum within an experimental setup where a real humanoid robot can explore multiple spaces of goals with several hundred continuous dimensions. While no particular target goal is provided to the system beforehand, this curriculum allows the discovery of skills of increasing complexity, that act as stepping stone for learning more complex skills (like nested tool use). We show that learning several spaces of diverse problems can be more efficient for learning complex skills than only trying to directly learn these complex skills. We illustrate the computational efficiency of IMGEPs as these robotic experiments use a simple memory-based low-level policy representations and search algorithm, enabling the whole system to learn online and incrementally on a Raspberry Pi 3.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

2018-00413 - Post-doctoral - Unsupervised learning with deep nets for intrinsically motivated exploration of dynamical systems

The Flowers team studies computational mechanisms allowing robots and humans to acquire openended repertoires of skills through life-long learning. This includes the processes for progressively discovering their bodies and interaction with objects, tools and others. In particular, we study mechanisms of intrinsically motivated learning (also called curiosity-driven active learning), autonomous ...

متن کامل

2018-00413 - Post-doctoral - Unsupervised learning with deep nets for intrinsically motivated exploration of dynamical systems

متن کامل

Unsupervised Learning of Goal Spaces for Intrinsically Motivated Goal Exploration

Intrinsically motivated goal exploration algorithms enable machines to discover repertoires of policies that produce a diversity of effects in complex environments. These exploration algorithms have been shown to allow real world robots to acquire skills such as tool use in high-dimensional continuous state and action spaces. However, they have so far assumed that self-generated goals are sampl...

متن کامل

2018-00413 - Post-doctoral - Unsupervised learning with deep nets for intrinsically motivated exploration of dynamical systems

متن کامل